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(54) Method for cleaving chimeric protein using processing enzyme 



(57) A chimeric protein is provided which comprises 
a target protein and has been designed so as to be eas- 
ily cleaved by a processing enzyme, permitting a target 
protein to be efficiently recovered. The chimeric protein 
is represented by the following formula: 



A-L-B 



wherein A represents a protective peptide; 

B represents a target peptide; and 
L represents a linker peptide having the sequence 
X 1 -X 2 -(Pro, Lys, or Arg)-Arg, wherein X 1 and X 2 rep- 
resent any amino acid, in its C-terminal region and 
a domain rich in His in its N-terminal region. 
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Description 

The present invention relates to a chimeric protein serving as a substrate in the production of a physiologically 
active peptide or a precursor thereof, and a process for producing the physiologically active peptide or a precursor 

5 thereof. Further, the present invention relates to a method for separating the physiologically active peptide from the 
chimeric protein to purify the physiologically active peptide. 

A number of peptide production processes utilizing the expression of a chimeric protein have been attempted. In 
this case, chemical or enzymatic cleavage is used for separating a target peptide. Enzymes used in this method include 
lysvl-endopeptidase (Achromobacter protease I) for specifically cleaving the peptide bond on the C-terminal side of a 

io lysine residue and staphylococci protease V8 for specifically cleaving the C-terminal side of a glutamic acid. Chemical 
methods include cleaving of an asparagine residue by nitrous acid and cleaving of a methionine residue by CNBr. 

For the chemical methods, the modification of the target peptide is unavoidable, necessitating the separation of 
the target peptide from various analogues in the purification of the target peptide after cleaving, On the other hand, for 
the enzymatic cleaving method, the specificity is high, and the cleaving is performed under relatively moderate condi- 

15 tions, facilitating the purification of the target peptide. Since, however, whether or not enzymes are usable depends 
upon the amino acid sequence of the target peptide, existing proteases cannot be alwaVs used, leading to a demand 
for universal cleavage enzymes. 

When a peptide hormone or a precursor thereof is produced in an organism, a precursor polypeptide for the peptide 
is specifically cleaved by an enzyme (a processing enzyme). Examples of such enzymes known in the art include a 

20 prohormone enzyme 1/3 (PC 1/3), a prohormone enzyme 2 (PC 2), and furin. Kex2 protease which functions in the 
production of an a-mating factor of Saccharomvces cerevisiae is also a kind of the above enzyme. When these process- 
ing enzymes are used for excising a target peptide from the chimeric protein, it is expected that the peptide hormone 
is not damaged and the processing enzymes is applicable to a wide variety of peptides. Therefore, the development 
of such production methods has been desired in the art. 

25 The present inventors have reported a method which comprises providing a polypeptide, derived from E. coli p- 

galactosidase, as a protective peptide, selecting a proper linker peptide, and efficiently producing a chimeric protein 
with a target peptide, as an insoluble inclusion body (Japanese Unexamined Patent Publication (Kokai) No. 5-328992). 
This chimeric protein is solubilized with a modifier, such as urea, and used as a substrate for processing enzymes. 

30 Non-published Art 

On the other hand, the present inventors have disclosed a method wherein Kex2 protease derived from Saccha- 
romvces cerevisiae , a membrane-bound enzyme, is produced as a soluble enzyme (a secretory Kex2 derivative) by 
altering the gene (European patent application claiming the priority of JP-8-73217 and JP-8-352580; title "Process for 

35 Producing Secretory Kex2 Derivative"). No design of a chimeric protein as a substrate in the use of this enzyme as a 
processing enzyme has been referred to in the specification, a copy of which is filed herewith. 

The development of a method for designing and producing a chimeric protein which can be efficiently produced 
as an inclusion body in Escherichia coli (E. coli) under large scale culture conditions and can be easily solubilized 
under conditions for permitting a processing enzyme to effectively act, and a method for designing and producing an 

40 amino acid sequence in the vicinity of the cleaving site so as to enable a target peptide to be efficiently cleaved from 
the chimeric protein by a processing enzyme, has been desired in the art. Further, the development of a highly versatile 
method for purifying the produced physiologically active peptide to a purity of not less than 99% has also been desired 
in the art. 

The general aim herein is to provide new and useful methods and materials for designing and producing a chimeric 
45 protein serving as a substrate in the production of a physiologically active peptide using a processing enzyme, such 
as a secretory Kex2 derivative : and to provide a process for purifying the peptide. 

The present inventors have made studies with the above issues in mind and, as a result, have for the first time 
clarified that the following. 

50 1) Insertion of a sequence rich in His, for example, a linker peptide containing {(His) 4 -Pro-Gly} n (n = 1 to 6), into 

a chimeric protein can lead to increased production and, in addition, increased solubility under conditions for re- 
acting the chimeric protein with a processing enzyme, such as a secretory Kex2 derivative. 

2) A requirement for a highly reactive substrate is such that the amino acid sequence X-,-X 2 -(Lys/Arg/Pro)-Arg : 
wherein X 1 preferably represents Lys, Arg, or His and X 2 preferably represents His or Phe, is used as an amino 

55 acid sequence in the linker at its site on which the processing enzyme act. 

3) After the reaction, a series of steps of shifting pH of the reaction solution to the acidic side, diluting the solution 
to decrease the concentration of the denaturant to permit most of the components other than the target peptide 
to be precipitated, followed by centrifugation or press filtration enables 95% or more of the target peptide to be 
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recovered in the supernatant, and use of, for example, a cation exchange chromatography, a low-pressure reversed 
phase chromatography, a reversed phase HPLC and the like, either alone or preferably in combination, enables 
the target peptide to be purified into a high purity with a high recovery. 

s Thus, according to one aspect of the present invention, there is provided a chimeric protein represented by the 

following formula; 

A-L-B 

70 

wherein A represents a protective peptide; 
B represents a target peptide; and 

L represents a linker peptide having the sequence X r X 2 -(Pro, Lys, or Arg)-Arg, wherein X-, and X 2 represent 
any amino acid, in its C-terminal region and a domain rich in His in its N-terminal region. 
is Preferably, the domain rich in His at the N-terminal has the sequence: [(His) 4 -Pro-Gly] n wherein n is 1 to 6. Further, 
preferably, X-, represents Arg, Lys, or His, and X 2 represents His or Phe. Any 1 to 5 amino acids may exist between 
the amino acid sequence in the N-terminal region and the amino acid sequence in the C-terminal region. 

According to another aspect of the present invention, there is provided a process for producing the above chimeric 
protein, comprising the steps of: transforming an expression vector containing DNA encoding the chimeric protein into 
20 a host cell; culturing the resultant transformants; and harvesting the chimeric protein from the culture. 

According to a further aspect of the present invention, there is provided a process for producing the above target 
peptide (B), wherein a processing enzyme is allowed to act on the above chimeric protein to cleave a peptide bond 
between the C-terminal of the linker peptide (L) and the N-terminal of the target peptide (B) to obtain the target peptide 
'(B). 

25 According to a preferred embodiment of the present invention, the above process comprises the steps of: trans- 

forming a vector, capable of expressing a gene encoding a chimeric protein, into a host cell; culturing the resultant 
transformant; disrupting the transformant to provide an insoluble fraction of an inclusion body; treating the insoluble 
fraction with a solubilizing agent to solubilize a chimeric protein in the inclusion bodies; cleaving a peptide bond, between 
the C-terminal of the linker amino acid residue of the solubilized chimeric protein and the N-terminal of the target 

30 peptide, by a processing enzyme, such as a secretory Kex2 derivative to separate the target peptide; and purifying 
the target peptide by precipitation, cation exchange chromatography, low-pressure reversed chromatography, and re- 
versed phase HPLC. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 

Fig. 1 illustrates the sequence of a synthetic oligomer used in the preparation of a synthetic hProPTH (1-84) gene. 
Fig. 2 is a diagram showing the preparation of a synthetic hProPTH (1-84) gene. 

Fig. 3 is a diagram showing the preparation of a plasmid pG210S (S/X), wherein Plac represents a promoter for 
a lactose operon of E. coli, Ttrp represents an attenuator terminator for TrpE of E. coli. 
40 Fig. 4 is a diagram showing the preparation of a plasmid pGP#1 9 capable of expressing a chimeric protein pGal- 

139s(FM) PPH84. 

Fig. 5 is a diagram showing the preparation of a plasmid pGP#1 9PPH34 capable of expressing a chimeric protein 
pGal-139s(FM) PPH34. 

Fig. 6 is a diagram showing the first half of a process for producing plasmids pG1 1 7SPPH34 and pGM 7SnHPPH34 
45 capable of expressing chimeric proteins PGal-1 1 7sPPH34 and pGal-1 1 7snHPPH34 (n = 1 to 6), wherein pG97SPPH34 
and pG97SnHPPH34 were prepared by substituting the primer S04 for the primer SOS and pG139SPPH34 and 
pG139SnHPPH34 were prepared by substituting the primer S06 for the primer SOS. 

Fig. 7 is a diagram showing the second half of a process for producing plasmids pG117SPPH34 and 
pGI17SnHPPH34 capable of expressing chimeric proteins pGal-117SPPH34 and pGal-117snHPPH34 (n = 1 to 6), 
so wherein pG97SPPH34 and pG97SnHPPH34 was prepared by substituting the primer S04 for the primer SOS and 
pG139SPPH34 and pG139SnHPPH34 were prepared by substituting the primer S06 for the primer SOS. 

Fig. 8 is a diagram showing the results of SDS-PAGE by which the influence of a oligohistidine linker {(His) 4 -Pro- 
Gly} n (n = 1 to 6) on the productivity of a chimeric protein has been investigated, wherein 4H represents that the number 
of histidine residues is 4, that is, n = 1. 
ss Fig. 9 is a diagram showing the influence of a polyhistidine linker {(His) 4 -Pro-Gly} n (n = 1 to 6) on the efficiency of 

the cleavage of hPTH(1-34) by Kex2-660, wherein the abscissa represents the relative kcat value with the kcat value 
of PGal-1 17sPPH34 being supposed to be 1. 

Fig. 10 is a diagram showing the influence of an amino acid at Kex2 protease recognition site P3 on the efficiency 
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of the cleavage of hPTH (1-34) by Kex2-660, the amino acid at the site P4 being immobilized onto a valine residue 
with the amino acid at the site P3 being substituted, wherein the abscissa represents the relative kcat value with the 
kcat value of |}Gal-117S4HVKPH34 being supposed to be 1. 

Fig. 11 is a diagram showing the influence of an amino acid at Kex2 protease recognition site P4 on the efficiency 
5 of the cleavage of hPTH (1-34) by Kex2-660, the amino acid at the site P3 being immobilized onto a histidine residue 
with the amino acid at the site P4 being substituted, wherein the abscissa represents the relative kcat value with the 
kcat value of [}Gal-117s4HVKPH34 being supposed to be 1. 

Fig. 1 2 is a typical diagram showing the structure of an expression vector pGKSPH34 for a pGal-1 1 7KS4HRHPH34 
gene, wherein pGal-117KS is a peptide with an arginine residue at the 14-position of pGal-117S being substituted by 
io a lysine residue. 

Fig. 13 is a diagram showing the preparation of a plasmid pG210ShCT(G). 

Fig. 14 shows nucleotide sequences of synthetic oligonucleotides for construction of hGLP-1 (7-37) gene (DNA 
(11), and nucleotide sequences of PCR primers using pGKSPH34 as a template. 

Fig. 15 shows a process for construction of plasmid pG117S4HGP. 
is Fig. 16 shows profiles of purification of hGLP-1 , wherein A shows a result a sample after reaction with Kex2, B 

shows a result for a supernatant from acid precipitation, C shows a result for a sample after cation exchange column 
chromatography and D shows a result for a sample after a low pressure reversed-phase column chromatography. Peak 
1 shows hGLP-1 (7-37), peak 2 shows hGLP-1 , peak 3 shows a protective peptide and peak 4 shows chimeric protein. 

20 DETAILED DESCRIPTION 

According to the method of the present invention, the above chimeric protein comprising a protective peptide, a 
linker, and a physiologically active peptide (a target peptide) can be cleaved by a processing enzyme to produce a 
physiologically active peptide. Examples of such peptides include, but are not limited to, human parathyroid hormone 

25 (hPTH (1-84)) and derivatives thereof (hPTH(1-34), hPTH(1-37), and hPTH (1-38)), other derivatives of human par- 
athyroid hormones (hPTH (1 -31 ) Gly, hPTH (1 -34) Gly, hPTH (1-37) Gly, hPTH (1 -38) Gly, hPTH (3-84), hPTH (3-34), 
hPTH (3-37), and hPTH (3-38)), cell growth factors, calcitonin and precursors thereof, glucagon-like peptide 1 (GLP- 
1) and derivatives thereof (GLP-1 (1-37), GLP-1 (7-37), GLP-1 (7-36) NH 2 and the like). 

Processing enzymes usable herein include, but are not limited to, Kex2 protease, furin, prohormone convertase 

30 1 (PC1) or prohormone convertase 2 (PC2). or derivatives thereof. In the present invention, secretory Kex2 derivatives 
prepared by removing the C-terminal hydrophobic region of Kex2 protease are particularly preferred. 

The protective peptide according to the present invention is preferably a peptide having a size of not more than 
220 amino acids. For example, a peptide of 90 to 21 0 amino acids counting from the 1 -position of the N-terminal of JE. 
coli p galactosidase is preferred. More preferred are a peptide of amino acids 1-97, 1-117, or 1-139 on the N-terminal 

3S side of _E. coli p-galactosidase. The above peptides with the cysteine residue being substituted by a serine residue are 
still more preferred. The above peptides with the arginine residue at the 1 4-position being substituted by a lysine residue 
is most preferred. Further, preferably, the protective peptide does not contain a sequence recognizable by a processing 
enzyme, for example, Pro-Arg, Arg-Arg or Lys-Arg. If the recognition sequence is contained in the protective peptide, 
the substitution of at least one amino acid it the above recognition sequence with a different amino acid to render the 

^0 sequence unrecognizable by the processing enzyme can prevent cleaving by the processing enzyme. 

The linker amino acid for improving the solubility of the chimeric protein is preferably hydrophilic. Some target 
peptides cause a change in charge of the chimeric protein, resulting in lowered productivity of the inclusion body. The 
hydrophilic amino acid is preferably a histidine residue having a buffering property around physiological pH (neutral) 
within bacteria which are a place for producing inclusion bodies. Further, insertion of the sequence Pro-Gly after de- 

4S naturation and solubilization is preferred from the viewpoint of moderating the secondary structure of the linker portion. 
In order to cope with a variety of physiologically active peptides, it is most preferred that a synthetic DNA fragment 
encoding (His) 4 - Pro-Gly be provided and properly inserted into a duplicate manner into a DNA fragment encoding a 
protective peptide to design a linker of {(His) 4 -Pro-Gly} n wherein n = 1 to 6. The sequence of the N-terminal portion of 
this linker will be hereinafter referred to as u LN. tt 

so Further, the present inventors have for the first time clarified that the Kex2 protease derivative (secretory Kex2 

derivative) derived from Saccharomvces cerevisiae, which has become mass-producible, when a chimeric protein is 
used as a substrate, an amino acid in the vicinity of the Kex2 protease recognition site (Lys/Arg/Pro)-Arg also greatly 
affects the activity. Specifically, they have clarified that when the amino acid sequence located at the C-terminal of the 
linker is X r X 2 -(Lys/Arg/Pro)-Arg, wherein X-, and X 2 represent any amino acid, provided that X n is Arg, Lys, or His and 

ss x 2 is His or Phe, the activity of the secretory Kex2 derivative becomes maximum. The sequence of the C-terminal 
portion of this linker will be hereinafter referred to as "LC." 

The target peptide produced and excised by the chimeric protein method may be purified by various methods. In 
this case, the use of the above chimeric protein has been found to permit the chimeric protein remaining unreacted, 
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the protective peptide, and impurities derived from E. coli to be efficiently removed by a simple precipitation procedure. 
Specifically, it has been found that, after the reaction with the secretory Kex2 derivative, shifting of pH to the weakly 
acidic side followed by dilution of the reaction solution to decrease the denaturant concentration enables not only most 
of impurities derived from JE. coli but also not less than 95% of the chimeric protein remaining unreacted and the 

5 protective peptides to be transferred into the precipitate, permitting not less than 95% of the target peptide to stay in 
the supernatant and a clear liquid containing the target peptide to be simply obtained by centrif ugation or press filtration. 

After pH adjustment or the like, the supernatant was directly passed through a cation exchange chromatograph 
to adsorb the target peptide, the target peptide was then eluted by taking advantage of a change in salt concentration 
or a change in pH, the eluate was passed through a low-pressure reversed phase chromatography to adsorb the target 

10 peptide which was then eluted by taking advantage of a change in concentration of an organic solvent followed by 
concentration of the eluate. After lowering the concentration of the organic solvent, the eluate was subjected to reversed 
phase HPLC, separation and fractionation were performed by taking advantage of a change in concentration of the 
organic solvent to give a high-purity target peptide. 

The present invention will be described by taking, by way of example, a human parathyroid hormone derivative 

is hPTH (1-34) as a target peptide. 

A pGal-210sPPH84 expression plasmid pG210ShProPTH was prepared by ligating synthetic DNA oligomers to- 
gether to prepare an hPTH (1-84) gene and substituting the gene thus prepared for a plasmid pG210ShCT[G] in its 
region encoding hCT[G] (Fig. 3). There was prepared a pG117SHPPH34 wherein a translation stop codon was sub- 
stituted for a codon encoding an amino acid (Val) located at the 35-position from the N-terminal of an hPTH (1-84) 

20 gene and a gene encoding pGai-117s (a peptide of amino acids 1-117 on the N-terminal side of E. coli p-galactosidase 
with the cysteine residue being substituted with a serine residue) was substituted for a gene coding for (3Gal-210S (a 
peptide of amino acids 1-210 on the N-terminal side of E. colj p-galactosidase with the cysteine residues being sub- 
stituted with the serine residues (Figs. 6 and 7). 

pG1 39SHPPH34 or pG97SHPPH34 can be prepared in the same manner as described above, except that a gene 

25 encoding pGal-139S or a gene encoding pGal-97S is inserted instead of (3Gal-210S. 

In these plasmids, structural genes respectively encoding pGal-97S, pGal-117S, and pGal-139S are connected 
to a structural gene of a hPTH (1-34) through a linker peptide Phe-Met-Lys-Ser-Val-Lys-Lys-Arg, and the structural 
gene encoding this chimeric protein is under control of a lac promoter. Further, this plasmid contains a tetracycline- 
resistant gene marker. 

30 pGal-97SPPH34 had low productivity although it had high solubility in a urea solution. On the other hand, for pGal- 

117SPPH34 and pGal-139SPPH34, although the productivity was high, the solubility under conditions for a reaction 
with the secretory Kex2 derivative was so low that the use of a larger amount of the secretory Kex2 derivative was 
necessary for excising the hPTH (1-34) from the chimeric protein. For this reason, a linker LN, which is hydrophilic, 
exhibits a buffering property around pH 7 (neutral) and comprises amino acids capable of inhibiting the formation of a 

35 secondary structure of the chimeric protein, was inserted between the protective peptide and the linker Met-Ser-Val- 
Lys-Lys-Arg, in order to provide a chimeric protein having high productivity and high solubility in a urea solution. 

A peptide having the sequence {(His) 4 -Pro-Gly} n , wherein n = 1 to 6, was used as LN. {(His) 4 -Pro-Gly} n , wherein 
n = 1 to 6, is introduced by synthesizing a DNA fragment encoding (His) 4 -Pro-Gly or {(His) 4 -Pro-Gly} 2 and inserting 
the DNA fragment into the 3'-terminal portion of a DNA fragment encoding the protective peptide in a suitable molar 

40 ratio. Thus, a gene can be prepared which encodes a chimeric protein comprising a linker peptide containing one or 
a plurality of the above amino acid sequences. 

A gene with a DNA fragment inserted in a reversal state was confirmed by DNA sequencing and excluded. E. coli 
strain M25 was transformed with plasmids encoding chimeric proteins to examine the productivity of the chimeric 
proteins (Fig. 8). As a result, it was found that in the case of the protective peptides pGal-97S and pGal-139S, n = 3 

^5 offered high productivity of the chimeric protein, while in the case of the protective peptide pGal-117S, n = 1 to 6 offered 
high productivity of the chimeric protein. Inclusion bodies of these chimeric proteins were subjected to cell disruption 
and centrif ugation to conduct washing and separation, solubilized with urea, diluted and used for a reaction with 
Kex2-660 (FIG. 9). 

As a result, it was found that the introduction of {(His) 4 -Pro-Gly} n , wherein n = 1 to 6, enhanced the sensitivity of 
50 all the chimeric proteins to Kex2-660. Accordingly, for the chimeric protein for the production of hPTH (1-34), pGal- 
1 1 7S4HPPH34 comprising pGal-1 1 7S as the protective peptide and (His) 4 -Pro-Gly as the linker peptide LN were adopt- 
ed by taking into consideration the productivity of the chimeric protein, the reactivity with Kex2-660, and the size of the 
protective peptide. 

The above results show that, in the design of a chimeric protein for excising a physiologically active peptide by a 
55 secretory Kex2 derivative, the insertion of a peptide of {(His) 4 -Pro-Gly} n , wherein n = 1 to 6, is suitable for improving 
not only the productivity inJE. coli but also the solubility under conditions for a reaction with a secretory Kex2 derivative. 

In the present process, the secretory Kex2 derivative should be used in an amount of about 1 : 1000 in terms of 
the molar ratio to the chimeric protein and, hence, occupies the major part of the cost of raw materials for the production. 
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Therefore, this has required a design of a chimeric protein which can be more easily cleaved by the secretory Kex2 
derivative. A pro-sequence Lys-Ser-Val-Lys-Lys-Arg of a precursor to a human parathyroid hormone was used for the 
secretory Kex2 derivative recognition site of pGal-1 1 7S4HPPH34. However, whether or not this sequence is best suited 
for cleaving by the secretory Kex2 derivative is unknown, and there is a possibility that, like many other proteases, 

5 optimization results in further increased cleaving efficiency. Since hPTH(1 -34) is connected to the recognition site on 
its C-terminal side, the substitution of the amino acid cannot be conducted, whereas the sequence on the N-terminal 
side can be freely selected so far as no recognition site is newly created. Accordingly, an attempt was made to covert 
the P3-site (X 2 ) of the chimeric protein as a substrate to any amino acid. Studies have been made on 18 amino acid 
sequences shown in Fig. 10. As a result, it was found that the introduction of His or Phe into X 2 permits hPTH (1 -34) 

10 to be excised from the chimeric protein with improved efficiency. 

Thereafter, studies have been made on the P4 site using 20 amino acid sequences shown in Fig. 11 with His, 
which exhibited no significant side reaction, immobilized onto X 2 . As a result, it was found that the introduction of Arg, 
Lys, or His into X A permits hPTH (1 -34) to be excised from the chimeric protein with higher efficiency. Arg-His-Lys-Arg 
which had offered the highest cleavage efficiency was selected as LC. 

is Further, in the course of the present studies, it was found that the peptide bond on the C-terminal side of the 

sequence Arg-Arg present at the 13- and 14-positions of (3Gal-117S is cleaved by a secretory Kex2 derivative and, 
consequently, in the subsequent step of purification by precipitation with an acid, the resultant pGal (1-14) is recovered 
in the supernatant fraction with hPTH (1-34) too being recovered therein. Therefore, Lys was substituted for 14 Arg to 
inhibit the cleaving, thereby preventing the production of (3Gal(1-14) . 

20 Based on the above results, for the chimeric protein for the production of hPTH (1-34), pGal-117S(PGal-117KS) 

with 14 Arg being substituted with Lys was adopted as the protective peptide, and a peptide having an amino acid 
sequence of Pro-His-His-His-His-Pro-Gly-Gly-Arg-Pro-Ser-Arg-His-Lys-Arg was adopted as a linker peptide. 

The present inventors expressed the chimeric protein pGal-117KS4HRHPH34, thus designed, in E. coli, the re- 
sultant inclusion body was washed, recovered, solubilized, and used for a reaction with a secretory Kex2 derivative, 

25 followed by studies on a purification method. As a result, a peptide containing impurities derived from E. coii, chimeric 
protein remaining unreacted, and protective peptide could be precipitated by utilizing a phenomenon wherein a shift 
of pH from neutral to the weakly acidic side causes a marked lowering in solubility, that is, by shifting pH from 7.8 to 
8.2, an optimal pH for a reaction with a secretory Kex2 derivative, to 6.2 to 6.5. 

Further, decreasing the denaturant (urea) concentration by dilution enabled most of the impurities to be precipi- 

30 tated. The precipitate was removed by centrifugation or press filtration to efficiently recover hPTH (1 -34) in the super- 
natant. The supernatant was adjusted to pH 5.0 and passed through a cation exchange chromatography (for example, 
PorosHS, SP Toyopearl) to adsorb hPTH (1-34), followed by elution with 0.3-0.4 M Nacl. Acetic acid was added to the 
eluate, the mixed solution was then passed through a low-pressure reversed phase chromatography (for example, 
PorosR2, SokenODS-W) to adsorb hPTH(1-34), followed by elution with about 30% acetonitrile to remove impurities 

35 which adversely affect concentration and reversed phase HPLC. 

The acetonitrile contained in the eluate was distilled off under reduced pressure, and the residue was filtered 
through a 0.22 jam filter, passed through reversed phase HPLC (for example, InertsiliPrepODS, TSK-ODS-120T) to 
adsorb hPTH(1-34), and fractionated by gradient elution of acetonitrile. A high-purity fraction was collected. Thus, 
hPTH(1-34) having a purity of 99.5% was fractionated with a recovery of 40 to 70%. The above results show that this 

40 recovery is the highest one among those reported in the art, substantiating the usefulness of the present invention. 

Examples 

The present invention will be described in more detail with reference to the following example, though it is not 
4S limited to these examples only. 

At the outset, plasmid and E. coli, used as materials for the present invention, and basic experimental procedures 
common to each example will be described. 

Plasmid 

so 

A plasmid pG21 0ShCT[G] is a plasmid which can express, under the control of a promoter of _E. coli lactose operon 
(a lac promoter), a chimeric protein comprising a peptide of amino acids 1 -21 0 on the N-terminal side of p-galactosidase 
with 76 Cys, 1 22 Cys, and 1 54 Cys being substituted by Ser, i.e. , pGal-21 OS, and hCT [G] (a peptide of human calcitonin 
with a glycine residue added to the C-terminal of the amino acid 32) bound to pGal-21 OS through a glutamic acid residue. 
55 it was prepared by ligating a DNA fragment comprising a gene encoding a part of pGal-210S, the DNA fragment 

being prepared by digesting pGHa210(Ser)rop" with restriction enzymes Pvull and EcoRI, with a DNA fragment com- 
prising a vector prepared by digesting pG97S4DhCT[G] with restriction enzymes Pvull and EcoRI (Fig, 1 3). A process 
for preparing pGHa210(Ser)rop' is disclosed in Japanese Examined Patent Publication (Kokoku) No, 6-87788, and E. 
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coli strain W31 1 0 containing the plasmid pG97S4DhCT[G] was designated as "Escherichia coli SBM 323," and depos- 
ited as FERM BP-3503 with Institute of Bioscience and Human Technology Agency of Industrial Science and Technol- 
ogy, Ministry of International Trade & Industry on August 8, 1991. 

Introduction of a DNA region, encoding a target peptide, as an EcoRI-Sall DNA fragment into plasmid pG210ShCT 
s [G] while matching the reading frame enables the expression of a chimeric protein with |iGal-210S. The plasmid was 
used for cloning of a synthetic human parathyroid hormone (hProPTH (1-84)) gene and as a material for the preparation 
of plasmid pGP#19 (Fig. 3). 

Escherichia coli (E. coli) 

10 

coli strain JM109 was purchased as a competent cell from Toyobo Co., Ltd. and used in the preparation of a 
plasmid. E. coli strain M25 (W3110/ompT: Sugimura et al. Biochem. Biophys. Res. Commun., 153, 753-759, 198B) 
was utilized in the expression of a chimeric protein. 

15 Medium 

The following media was used in the culture of E. coli. 

SB medium: 0.5% (w/v) glycerol, 2.4% (w/v) yeast extract, 1 .2% (w/v) trypton, 0.1 M potassium hydrogenphosphate 
buffer (pH 7.5). 

20 NU1 synthetic medium: 0.4% (w/v) yeast extract, 0.4% (w/v) K 2 HP0 4 , 0.4% (w/v) KH 2 P0 4: 0.27% (w/v) Na 2 HP0 4 , 

1.2% (w/v) (NH 4 ) 2 S0 4 , 0.02% (w/v) NH 4 CI, 0.3% (w/v) L-methionine, 0.2% (w/v) MgS0 4 .7H 2 O t 40 mg/L FeS0 4 -7H 2 O, 
40 mg/L CaCI 2 -2H 2 0, 10 mg/L AICI 3 -6H 2 0, 10 mg/L CoCI 2 -6H 2 0, 2 mg/L Zns0 4 -7H 2 0, 2 mg/L Na 2 M0 4 -2H 2 0, 1 mg/ 
L CuCI 2 -2H 2 0, 0.5 mg/L H 3 B0 3 , and 10 mg/L MnS0 4 -nH 2 0. 

25 Basic experimental procedures 

In the examples, the following experimental procedure was used unless otherwise specified. 

DNA primers were synthesized with a automatic synthesizer (Model 320 A, manufactured by Applied Biosystems) 
by the phosphoamidite method. The DNA sequence was determined by the dideoxy method. 
30 DNA cleavage was performed by a reaction using a restriction enzyme (amount: 3 to 10 folds) indicated by the 

supplier for one nr. The structure of the plasmid was analyzed using 0.5 to 1 u.g of DNA in 20 |_il of a reaction solution, 
and the preparation of DNA was performed using 3 to 10 u.g of DNA in 50 to 100 u.l of a reaction solution. The reaction 
temperature, the reaction buffer and other conditions were as indicated by the supplier. 

A sample for electrophoresis on agarose gel was prepared by adding 0.2 volume of a dye (a 15% (w/v) aqueous 
35 Ficoll solution containing 0.25% (w/v) bromphenol blue) to the reaction solution. TAE buffer (40 mM Tris/acetic acid, 2 
mM EDTA) was used as a buffer for electrophoresis on agarose gel. For the structural analysis of a plasmid, electro- 
phoresis at 100 V for one hr was performed using Mupid-2 (manufactured by Cosmo-Bio Co., Ltd.), and for the prep- 
aration of DNA, electrophoresis was performed on a horizontal gel (20 cm x 1 5 cm x 0.7 cm) at 1 50 V for 4 hr or at 35 
V for 1 3 hr. The gel was stained with an aqueous ethidium bromide solution (0.5 pg/ml) for 20 min and irradiated with 
40 ultraviolet light to detect a DNA band. The agarose gel concentration was 1 .0% or 2.0% (w/v) depending upon the size 
of the DNA fragment to be fractionated. 

DNA in the agarose gel was extracted from the agarose gel by placing the gel in a dialysis tube filled with 0.1 x 
TAE buffer and applying a voltage to conduct elution. The DNA solution was treated with phenol and chloroform and 
subjected to precipitation with ethanol to purify DNA. 
45 Ligation was performed by adding 10 units of T4 DNA ligase to 30 uJ of a reaction solution (67 mM Tris/HCi (pH 

7.5), 5 mM MgCI 2 , 5 mM DTT, 1 mM ATP) containing 0.05 to 1 \ig of a DNA fragment and allowing a reaction to proceed 
at 16°C for 12 to 18 hr or by using TAKARA Ligation Kit (manufactured by Takara Shuzo Co., Ltd.). 

E.coli strain JM 1 09 was transformed as indicated by the supplier, E. coli strain M25 was transformed by the calcium 
chloride method, and the transformed strain was selected by taking advantage of drug resistance (tetracycline). 
so An SDS-polyacrylamide gel electrophoresis (SDS-PAGE) sample was prepared by adding the same amount of 

SDS sample buffer (63 mM Tris/HCl (pH 6.8), 10% (w/v) glycerol, 10% (w/v) SDS, and 10 u.g/ml bromophenol blue) 
and boiling the mixture for 3 min. The electrophoresis was performed on 16% (w/v) SDS-PAGE mini (TEFCO) gel at 
20 mA per piece of gel for 70 min, and the gel was stained with Coomassie Brilliant Blue G. 

hPTH(1-34) was quantitatively determined by first subjecting a solution containing hPTH (1-34) to 10- to 20-fold 
ss dilution with a sample buffer (1 M acetate/2 M urea), centrif uging the diluted solution to obtain a supernatant, applying 
the supernatant to HPLC (LC10A, manufactured by Shimadzu Seisakusho Ltd.) to which YMC-ODS-A302 (d4.6 mm 
x 1 50 mm) column (manufactured by K.K. Yamamura Kagaku Kenkyujo) has been connected, conducting development 
using liquid A (0.1% (v/v) trifluoroacetic acid (TFA)/10% (v/v) acetonitrile) and liquid B (0.094% (v/v) TFA/50% (v/v) 
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acetonitrile) with gradient B 40% -> 80%/20 min, and comparing the area of a hPTH (1-34) peak, detected by deter- 
mination of absorbance at 214 nm, with a peak area of standard hPTH (1-34) having a known concentration. 

Other basic gene manipulation was performed according to a method described in Molecular Cloning (Maniatis, 
et al., Cold Spring Harbor Laboratory, New York (1982)) unless otherwise specified. 

5 

Example 1 Preparation of hProPTH (1-84) gene 

As shown in Fig. 1, hProPTH (1-84) gene was synthesized as 14 divided fragments of U1 to U7 (SEQ ID NOS: 1 
to 7) and L1 to L7 (SEQ ID NOS: 8 to 14). 
w These fragments were ligated as described below to prepare hProPTH (1 -84) gene (Fig. 2). 

At the outset, about 1 u.g of DNA fragments U1 (SEQ ID NO: 1) and about 1 jag of DNA fragment L7 (SEQ ID NO: 
14) were reacted in 15 uJ of a phosphorylation solution (50 mM Tris/HCI (pH 7.6), 10 mM MgCI 2 , 5 mM DTT) containing* 
16 units of T4 polynucleotide kinase and 0.5 nM (1 MBq or more) [r 32 P]dATP at 37°C for 15 min. 5 uJ of a phosphor- 
ylation solution containing 5 mM ATP was added thereto, and a reaction was allowed to proceed at 37°C for 45 min. 
is The same procedure was repeated for a combination of U2 (SEQ ID NO: 2) with L6 (SEQ ID NO: 13), a combination 
of U3 (SEQ ID NO: 3) with L5 (SEQ ID NO: 12), a combination of U4 (SEQ ID NO: 4) with L4 (SEQ ID NO: 11), a 
combination of U5 (SEQ ID NO: 5) with L3 (SEQ ID NO: 10), a combination of U6 (SEQ ID NO: 6) with L2 (SEQ ID 
NO: 9), and a combination of U7 (SEQ ID NO: 7) with L1 (SEQ ID NO: 8). 

The above 7 reaction solutions were combined, and the resultant mixture was subjected to precipitation with ethanol 
20 to recover DNA which was then dissolved in 80 uJ of 100 mM Tris/HCI (pH 7.6), 6.5 mM MgCI 2 , and 300 mM NaCI. A 
40 u.l aliquot was allowed to stand at 95°C for 5 min, and the temperature was then decreased to 43°C over a period 
of 30 min. It was then ice cooled, 40 uJ of ligation liquid B (manufactured by Takara Shuzo Co., Ltd.) was added thereto, 
and the mixture was allowed to stand at 26°C for 1 5 min. 

This sample was elect rophoresed on 5% polyacrylamide. After the electrophoresis, a ligated DNA fragment was 
25 detected by autoradiography. A DNA fragment corresponding to about 280 bp was extracted from the gel and then 
purified by a conventional method. 

Example 2 Preparation of EGal-1 39S(FM1PPH84 expression plasmid pGP#19 

30 The about 280 bp DNA fragment containing a synthetic hProPTH (1-84) has a restriction enzyme EcoRI site at 

the 5'-terminal and a restriction enzyme Sail site at the 3' -terminal. Cloning of the hProPTH (1 -84) gene was performed 
by inserting the EcoRI/Sall DNA fragment into the EcoRl/Sall site of pG210ShCT[G]. 

After pG210ShCT[G] was digested with restriction enzymes EcoRI and Sail, an about 3.5 kb DNA fragment con- 
taining the vector portion was prepared. This fragment was ligated with the about 280 bp DNA fragment of hProPTH 

35 (1-84) prepared in Example 1 to prepare pG210ShProPTH (Fig. 3). pG210ShProPTH was transformed into E. coli 
strain JM109 to prepare JM109 [pG210ShProPTH]. 

Further, linkers KM091 (SEQ ID NO: 15) and KM092 (SEQ ID NO: 16) were inserted into the restriction enzyme 
Xhol/EcoRI site of pG210ShProPTH to prepare plasmid pG21 OS (S/X) (Fig. 3). The above linker has restriction enzyme 
Xhol and EcoRI sites at respective both ends and Sad site therebetween. 

40 pG21 OS (S/X) was digested with restriction enzymes Sad and Xhol, and a DNA region encoding pGal-21 OS was 

specifically deleted using Deletion Kit for Kilo-Sequence(manufactured by Takara Shuzo Co., Ltd.). Ends were repaired 
using a Klenow fragment, followed by self-ligation to prepare plasmid pGP#19 encoding a chimeric protein pGal-139S 
(FM)PPH84 with pGal-139S and hProPTH (1-84) bound thereto through Phe-Met (Fig. 4). coli strain JM109 with 
pGP#19 introduced thereinto was designated as "JM109 [pGP#19]." 

45 

Example 3 Preparation of &Gal-117SmHPPH34 expression plasmid pG117SmHPPH34 

In this example, the preparation of pGal-117SmHPPH34 expression plasmid pG117SmHPPH34 will be described 
by way of example, and pGal-97SmHPPH34 expression plasmid pG97SmHPPH34 or pGal-1 39SmHPPH34 expres- 
so sion plasmid pG139SmHPPH34 can be prepared using pGal-97S or pGal-139S instead of pGal-117S. 

1) Preparation of pGP#19PPH34 

pGP#1 9PPH34 was prepared as follows. A DNA fragment of hPTH (1 -84) with codon GTT of 35 Val being converted 
ss to translation stop codon TAA was amplified by PCR using pGP#19 as a template and S01 (SEQ ID NO: 17) and S02 
(SEQ ID NO: 18) as primers. A restriction enzyme Aatll-Sall DNA fragment was isolated and purified by a conventional 
method, followed by exchange with a portion corresponding to pGP#19 to prepare pGP#19PPH34 (Fig. 5). 
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2) Preparation of pG117SPPH34 

Amplification was performed by PCR using pG210 (S/X) as a template and S03 (SEQ ID NO: 19) and SOS (SEQ 
ID NO: 20) as primers followed by digestion with restriction enzymes Pvul and Smal to prepare a DNA fragment. 
5 Further, amplification was performed by PCR using pGP#19PPH34 as a template and S07 (SEQ ID NO: 21) and S02 
as primers followed by digestion with restriction enzymes Sail and Smal to prepare another DNA fragment. The above 
two DNA fragments and a restriction enzyme Pvul-Sall DNA fragment containing a replication origin for pGP#1 9PPH34 
were ligated together using T4 ligase to prepare pG117SPPH34 (Figs. 6 and 7). 

io 3) Preparation of pG117SmHPPH34 (m = 4 or 8) 

pG117SmHPPH34 (m = 4 or 8) was prepared by inserting linkers S08 (SEQ ID NO: 22) and S09 (SEQ ID NO: 23) 
encoding (His) 4 -Pro-Gly and linkers S10 (SEQ ID NO: 24) and S11 (SEQ ID NO: 25) encoding {(His) 4 -Pro-Gly} 2 into 
the restriction enzyme Smal site of pG117SPPH34. In this connection, it should be noted that a plasmid (m = 12, 16 
is or 24) can be prepared by varying the length of the linker (Figs. 6 and 7). 

Further, pG117S4KPPH34 was prepared by inserting linkers S12 (SEQ ID NO: 26) and S13 (SEQ ID NO: 27) 
encoding (Lys) 4 -Pro-Gly into the restriction enzyme Smal site of pG117SPPH34. Similarly, pG97S4kPPH34 and 
pG139S4kPPH34 can be prepared. 

The number and direction of inserted linkers were confirmed by determining the DNA sequence after the prepa- 
re ration of plasmids, and chimeric proteins prepared in this example are tabulated in Table 1 . 



Table 1 





pGal-97SmHPPH34 


pGal-117SmHPPH34 


pGal-139SmHPPH34 




O 


O 


O 


4H 


O 


O 




8H 


o 


O 


o 


12H 


o 


o 


o 


16H 




o 




24H 




o 




4K 


o 


o 


o 



In the table, H represents a histidine residue, K represents a lysine residue, and the numeral attached to H or K 
represents the number of amino acids. 

Example 4 Production of chimeric protein hPTH(1 -34) 

The productivity of the chimeric proteins listed in Table 1 were examined using SDS-PAGE. 

16 plasmids containing a gene encoding a chimeric protein listed in Table 1 were each introduced into E. coli strain 
JM1 09. Transformed cells were cultured in a test tube containing 2 ml of SB medium at 37°C with shaking for 1 6 hours, 
followed by dilution with physiological saline to a culture turbidity (OD 660) of 10. An equal amount of an SDS sample 
buffer was added to 100 uJ of the suspension, and the mixture was boiled for 3 min. 10 u.l of the supernatant was 
subjected to SDS-PAGE to compare the amount of the chimeric protein produced per bacterium. 

The results are shown in Fig. 8. As is apparent from Fig. 8, the amount of expressed chimeric protein varied 
depending upon the p-galactosidase derivative/number of histidine residues combination. Specifically, for pGal-97S, 
high productivity could be attained in the case of {(His) 4 -Pro-Gly} n wherein n = 2 and 3; for pGal-H7S, high productivity 
could be attained in the case of {(His) 4 -Pro-Gly} n wherein n = 1, 2, 3, 4, and 6; and for pGal-139S, high productivity 
could be attained in the case of {(His) 4 -Pro-Gly} n wherein n = 2 and 3. In particular, for the derivatives of pGal-117S, 
the introduction of {(His) 4 -Pro-Gly} n , wherein n = 1 to 6, resulted in markedly increased amount of expressed chimeric 
protein, and, in this case, the amount of the expressed chimeric protein was larger than that in the case of a mere 
change in charge by the lysine residue. 

Example 5 Cleavage of hPTHf1-34) from chimeric protein 

For the high productive chimeric proteins (pGal-97SmHPPH34 (m = 12), pGal-117SmHPPH34 (m = 0, 4, 8, 12, 
16, and 24), and pGal-139SmHPPH34 (m = 0 and 12 with m = 0 being control), in order to investigate the suitability 
as a substrate for Kex2-660, the chimeric protein inclusion body was recovered, dissolved in 8 M urea, and diluted to 
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give a reaction composition (20 mM Tris/HCI (pH 8.2), 100 mM NaCI, 3.0 M urea, 2.0 mM CaCI 2 , 8 mg/ml chimeric 

protein). Kex2-660 was then added to a final concentration o1 20 kU/ml to cleave hPTH (1-34). 90% or more of [iGal- 

117SPPH34 as the control was excised under these conditions tor one nr. 

In Fig. 9, the reactivity of each derivative with Kex2-660 was expressed in terms of the ratio of kcat to 
5 pG117SPPH34. The quantitative determination of hPTH (1-34) was carried out according to the method described in 

the above "Basic experimental procedures." As a result, it was found that, for all the cases, the introduction of {(His) 4 - 

Pro-Gly} nt wherein n = 1 to 6, further results in improved reactivity with Kex2-660. 

In particular, for pGal-117S4HPPH34, it was found that the insertion of one (His) 4 -Pro-Gly linker results improved 

production of the chimeric protein and reactivity with Kex2-660 and, in addition, offers a solubility of not less than 15 
10 g/L under Kex2 reaction conditions, i.e., contributed to an improvement in solubility over the solubility of pGal- 

139SPPH34(8g/L). 

Example 6 Mass production of hPTH(1-34) 

is in order to mass-produce the chimeric protein pGal-117S4HPPH34, which had been found to be expressed with 

a high productivity and to be suitable as a substrate for Kex2-660, the plasmid pG117S4HPPH34 containing a gene 
encoding this chimeric protein was transformed into E. coli strain M25 (W3110/ompT) to prepare strain M25 
[pG117S4HPPH34]. The strain M25[pG1 17s4HPPH34] was cultured using a medium (2% (w/v) yeast extract, 1% (w/ 
v) trypton, 1 .2% (w/v) K 2 HP0 4 , 0.3% (w/v) KH 2 P0 4 and 0.5% (w/v) glycerol) contained in a 20 L culture tank at 37°C. 

20 When the cell concentration reached OD660 =' 1 .0, IPTG (isopropyl p-thiogalactoside) was added to a final con- 

centration of 1 mM, followed by further culture until the cell concentration reached OD660 = 12. The cells were har- 
vested, suspended in TE (10 mM Tris/HCI, 1 mM EDTA (pH 8.0)). The suspension was subjected to a high pressure 
homogenization (Manton-Gaulin) to disrupt the cells. The homogenate was centrif uged to obtain precipitate containing 
chimeric protein, which was then washed with TE and deionized water. About 100 g of inclusion bodies containing 

25 chimeric protein were obtained from 20 L culture. To 250 ml of the suspension of the inclusion (160 g/L) were added 
100 ml of 1 M Tris/HCI (pH 8.2), 50 ml of 5 M NaCI, 500 ml of deionized water, and 900 g of urea, and the mixture was 
stirred for 30 min in a thermostatic chamber kept for 30 min at 30°C to dissolve the components, diluted with warmed 
deionized water, and brought to 5 L at 30°C. 

50 ml of a 250 mM CaCI 2 solution was slowly added while stirring, and Kex2-660 was added thereto to 20 kU/ml. 

30 2 hr after the addition of Kex2-660 (percentage cleavage: not less than 90%), pH was adjusted to 6.4 by the addition 
of acetic acid, followed by two-fold dilution with deionized water to coagulate and sediment the chimeric protein re- 
maining unreacted and the protective peptide. Centrif ugation was performed to obtain a supernatant containing hPTH 
(1-34). The supernatant was adjusted to pH 5.0 by the addition of acetic acid and passed through a cation exchange 
resin (SP Toyopearl) column equilibrated with a 10 mM sodium acetate buffer (pH 5.0) to adsorb hPTH (1-34), followed 

35 by elution of hPTH (1-34) with 0.4 M NaCI. 

Acetic acid was added to a final concentration of 3%, and the mixture was passed through a low-pressure reversed 
phase column (SokenODS-W) equilibrated with 3% (v/v) acetic acid to adsorb hPTH (1-34), followed by elution of 
hPTH (1-34) with a 30% (v/v) acetonitrile solution containing 3% (v/v) acetic acid. Acetonitrile was removed under 
reduced pressure, and the residue was filtered through a 0.22-u,m membrane filter and purified on a reversed phase 

40 HPLC preparative column (TSKgelODS120T 0> 55x600). In this case, the elution was done using acetonitrile with a 
linear density gradient (A: 5% (v/v) acetic acid/1 0% (v/v) acetonitrile; B: 5% (v/v) acetic acid/40% (v/v) acetonitrile; %B 
- 20%->80%/60 min) at a flow rate of 40 ml/min. The recovery of hPTH (1 -34) for each step was tabulated in Table 2. 



Table 2 



45 


Recovery of hPHT (1-34) 


Steps 


Amount of PTH (g) 


Recovery (%) 




Reaction with Kex2-660 


7.0 


100 




Supernatant obtained upon precipitation with acid 


6.7 


95 


50 


Cation exchange column 


6.0 


85 




Low-pressure reversed phase column 


5.8 


82 




HPLC 


4.0 


57 



Example 7 Substitution of amino acids at Kex2-660 recognition site P3 of chimeric protein 



In order to convert amino acids at the P3 site (Lys) to different amino acids, oligonucleotides S14-S18 (SEQ ID 
NOS: 28 to 32) with a codon corresponding to the P3 site being converted were synthesized, and a DNA fragment was 
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synthesized by PCR using these oligonucleotides and S02 (SEQ ID NO: 18) as primers and pG117S4HPPH34 as a 
template. The DNA fragment thus obtained was purified by ethanol precipitation, cleaved by Stul and Sail, and elec- 
trophoresed on 2.0% (w/v) agarose gel to purify a target 0.4 Kbp Stul-Sall fragment 

This DNA fragment was ligated with a DNA fragment, encoding a p galactosidase portion of pG117S4HPPH34 : 

s which had been prepared by digestion with Stul and Sail and conducting purification in the same manner as described 
just above in connection with the target Stul-Sall DNA fragment. After the completion of the ligation reaction, the ligation 
mixture was used to transform E. coli strain JM109. For each combinations, 10 tetracycline resistant clones were 
obtained : and plasmids from the clones were analyzed by DNA sequencing to obtain desired expression vectors. 
That is, there was prepared an expression vector pG117S4HVXPH34 for a chimeric protein pGal-117S4HVXPH34 

10 having an amino acid sequence wherein the Kex2-660 recognition site Ser-Val-Lys-Lys-Arg of a chimeric protein en- 
coding pG117S4HPPH34 has been substituted with Ser-Val-X-Lys-Arg wherein X = Gly, Ala, Val, Leu, lie, Ser, Thr, 
Asp, Glu, Asn, Gin, Lys, Arg, Phe, Tyr, Trp, His, or Pro. 

In order to investigate the reactivity of each chimeric protein with Kex2-660, each clone was cultured in 50 ml of 
an SB medium, and, when the cell concentration reached OD660 = 1.0, expression was induced by IPTG, followed by 

75 culture for additional 4 hr. Thereafter, cells were harvested, washed with a TE buffer, suspended, disrupted, and cen- 
trifugedto prepare inclusion bodies. Inclusion bodies were washed with 1% (w/v) Triton X1 00 containing 10 mM EDTA, 
and TE to prepare purified inclusion bodies. 90% or more of the peptide component contained in the inclusion body 
was a target chimeric protein. 20 pi of 1 M Trts/HCI (pH 8.2), 20 jil of 5 M NaCI, and 300 pi of 10 M urea were added 
to and dissolved in a suspension of each chimeric protein inclusion body, the mixtures were then diluted with 650 pi 

20 of deionized water and held in a thermostatic chamber of 30°C for 1 0 min, 1 0 pi of a 250 mM CaCl 2 solution was added 
thereto, and Kex2-660 was added to a concentration of 20 kU/ml. 

90% or more of pGal-117SPPH34 as the control was excised under these conditions for one hr. hPTH (1-34) 
produced was quantitatively determined by HPLC using the above ODS column. The reactivity of each derivative with 
Kex2-660 was expressed in terms of the ratio of kcat to pG117S4HPPH34 (Fig. 10). It was found that, when X = Phe 

25 or His, the cleavage efficiency of hPTH (1 -34) by Kex2-660 is improved, whereas when X = Pro, substantially no hPTH 
(1-34) is excised by Kex2-660. 

Example 8 Substitution of amino acids at Kex2-660 recognition site P4 of chimeric protein 

30 Based on the results of Example 7 and the fact that the amount of the by-product at the time of cleavage is small, 

a chimeric protein having a histidine residue at the P3 site, pGal-117S4HVHPH34, was selected as a chimeric protein 
for the production of hPTH (1-34), and an attempt has been made to substitute the amino acids at the P4 site to further 
improve the reactivity with Kex2-660. Oligonucleotides S1 9 to S24 (SEQ ID NOS: 33-38) were synthesized, and a DNA 
fragment with mutation being introduced thereinto was synthesized by PCR using the above oligonucleotides and S02 

35 as primers and pG117S4HVHPPH34 as a template. 

The procedure of Example 7 was repeated to prepare an expression vector pG117S4HXHPH34 for a chimeric 
protein pGal-117S4HXHPH34 having a structure wherein the Kex2-660 recognition site Ser-Val-His-Lys-Arg of the 
chimeric protein has been substituted with Ser-X-His-Lys-Arg wherein X = Gly, Ala, Leu, lie, Ser, Thr Asp, Glu, Asn, 
Gin, Lys, Arg, Cys, Met, Phe, Tyr, Trp, His, Pro, or Val. 

40 The sensitivity to Kex2-660 was judged using the chimeric proteins as the substrate under the same conditions 

as used in Example 6. As a result, it was found that when X = Lys or Arg, the efficiency of the cleavage of hPTH (1 -34) 
by Kex2-660 is improved, whereas when X is an acidic amino acid (Asp or Glu), substantially no hPTH (1 -34) is excised 
by Kex2-660 (Fig. 11) Thus, it has for the first time become apparent that Kex2-660 protease requires an arginine 
residue at the P1 site and a strongly basic amino acid or a proline residue at the P2 site and, at the same time, prefers 

45 a histidine residue or a phenylalanine residue at the P3 site and a strongly basic amino acid at the P4 site and that the 
presence of a proline residue at the P3 site or the presence of an acidic amino acid at the P4 site results in remarkably 
lowered chimeric protein cleaving activity. 

Example 9 Removal of Kex2-660-cleaved site of protective peptide 

50 

The Kex2-660 cleaving reactions in Examples 7 and 8 were traced with the elapse of time by HPLC. As a result, 
it was found that a peptide bond on the C-terminal side of the sequence Arg-Arg at the 13- and 14 positions of the 
protective peptide pGal-117s was cleaved. Therefore, it was expected that the resultant PGal (1-14) is included at the 
time of purification of hPTH (1-34) by precipitation with an acid. Accordingly, an attempt has been made to substitute 
55 14 Arg with Lys to inhibit cleaving by Kex2-660, thereby inhibiting the production of pGal (1-14). 

A plasmid pGKSPH34 (Fig. 1 2) encoding a chimeric protein (PGal-1 1 7KS4HRHPH34) with 1 4 Arg being substituted 
with Lys was prepared as follows. At the outset, a DNA fragment with mutation being introduced thereinto, Asel-RK, 
was synthesized by PCR using an oligonucleotide S25 (SEQ ID NO: 39) and S26 (SEQ IDNO: 40) as primers and 
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pG117S4HRHPH34 as a template. Similarly, a DNA fragment, RK-Bglll, was prepared using an oligonucleotide S27 
(SEQ ID NO: 41) and S28 (SEQ ID NO: 42) as primers. 

Each DNA fragment was purified, and about 1 ng of each of the purified DNA fragments was weighed, the weighed 
fragments were mixed together, PCR was performed four times in the absence of primers to synthesize an Asel-Bglll 
5 fragment with mutation being introduced thereinto. The oligonucleotides S25 and S28 were added, and PCR was 
further continued to amplify the fragment. The fragment was purified by ethanol precipitation, cleaved by Asel and 
Bglll, electrophoresed on a 2.0% (w/v) agarose gel to prepare a target 0.35 kbp fragment. An Asel/Sphl (1.8 kbp) 
fragment and a Sphl/Bglll (1,4 kbp) fragment of pG117S4HRHPH34 were electrophoresed on a 1.0% (w/v) agarose 
gel to purify these fragments. 

10 The above three fragments were ligated together, and the ligation mixture was used to transform E. coli strain 

JM109. Tetracycline resistant clones were obtained, and plasmids prepared by alkali-lysis methods were analyzed by 
DNA sequencing. A target clone was selected which had a plasmid pGKSPH34 containing a gene wherein a DNA 
sequence CGG encoding 14 Arg of the protective peptide has been substituted with AAG. Inclusion bodies of chimeric 
proteins were obtained in the same manner as in Example 5. They were reacted with Kex2-660. As a result, it was 

is found that the above substitution substantially completely inhibited cleaving of the protective peptide, lightening the 
burden imposed on the purification process. 

Example 10 Mass-production of chimeric protein QGal-117KS4HRHPH34 and production of hPTH (1-34) 

20 in the example, the production of the chimeric protein pGal-1 1 7KS4HRHPH34, under large scale culture conditions, 

which has been considered to be best suited as a substrate for Kex2-660 on a test tube level, the cleavage by Kex2-660 
protease, and the purification of hPTH (1-34) were performed. 

pGKSPH34 was used to transform E. coli strain M25 to tetracycline resistance to harvest tetracycline-resistant 
strains, followed by the preparation of a production strain M25[pGKSPH34]. The strain M25[pGKSPH34] was cultured 

25 on a large scale in an NUI synthetic medium containing 2% (w/v) glucose. 

After the consumption of glucose, glycerol was fed as a carbon source, and culture was performed for 24 hr while 
regulating pH to 7.0. The final cell concentration was 1 50 to 200 OD. This chimeric protein was stably produced under 
large scale culture conditions, and no significant inhibition of growth of the host E. coli despite the high level expression, 
was observed. The amount of the inclusion body produced was about 5 g/L. After disruption of the cells, insolubies 

30 were washed with TE and deionized water to prepare a purified inclusion body. 

The inclusion body was solubilized with urea and diluted to give a reaction solution composition of 20 mM Tris/ 
HCI (pH 8.2), 50 mM NaCI, 2.5 mM CaCI 2 , 2.7 M urea, and 8.0 mg/ml chimeric protein. At a reaction temperature of 
30°C, Kex2-660 was added to a concentration of 5 kU/ml to cleavage hPTH (1-34) from the chimeric protein, and the 
production of hPTH (1-34) was quantitatively determined. As a result, 30 min after the initiation of the reaction, the 

35 percentage cleavage reached 90% or more, and the amount of Kex2-660 was decreased to 1/4 as compared with 
pGal-117S4HPH34. After the completion of the reaction, hPTH (1-34) was purified in the same manner as in Example 
5. The recovery in the purification process was substantially the same as that in Example 5. 

Example 11 Preparation of BGal-117S4HGP production plasmid, pG117S4HGP 

40 

A plasmid which produces a chimeric protein (pGal-1 1 7S4HGP) containing a human glucagon-like peptide I (hGLP- 
1 (7-37)), pG117S4HGP, was constructed according to the following procedure. At the outset, four DNAs shown in Fig. 
14 (a) (GLP-1 (SEQ ID NO: 43), GLP-2 (SEQ ID NO: 44), GLP-3 (SEQ ID NO: 45), and GLP-4 (SEQ ID NO: 46)) were 
synthesized. These DNAs at the 5'-terminal were phosphorylated with T4 polynucleotide kinase, mixed together, and 
45 annealed to prepare a hGLP-1 (7-37) gene (DNA(I)). On the other hand, two synthetic DNAs (117S4H Sphl primer 
(SEQ ID NO: 47) and 117S4H Bglll primer (SEQ ID NO: 48)) shown in Fig. 14 (b) were used as a primer for PCR, and 
PCR was performed using pGKSPH34 as a template. The resultant PCR product was digested with Bglll and Sphl to 
prepare a DNA fragment having Bglll and Sphl sticky ends (DNA (II)). 

Then, pGKSPH34 was digested with Bglll and Sail and electrophoresed on a 1% agarose gel to recover a larger 
50 DNA fragment (DNA (III)) from the gel. DNA (I), DNA (II), and DNA (III) thus obtained were ligated together using a T4 
DNA ligase to prepare pG117S4HGP (Fig. 15). 

Example 12 Production of chimeric protein [3Gal-117S4HGP 

55 E. coli strain W3110M25 having the above-prepared pG117S4HGP was cultured in a medium containing 1.5% 

glucose, 4 g/l K 2 HP0 4 , 4 g/l KH 2 P0 4 , 2.7 g/l Na 2 HP0 4 , 0.2 g/l NH 4 CI, 1.2 g/l (NH 4 ) 2 S0 4 , 4 g/l yeast extract, 2 g/l L- 
methionine, 2 g/l MgS0 4 .7H 2 0, 40 mg/l CaCI 2 -2H 2 0, 40 mg/l FeS0 4 -7H 2 0, 10 mg/l MnS0 4 -nH 2 0, 10 mg/l AICI 3 -H 2 O t 
4 mg/l CoCI 2 -6H 2 0, 2 mg/l ZnS0 4 7H 2 0, 2 mg/l Na 2 Mo0 4 .2H 2 0, 1 mg/l CuCI 2 .2H 2 0, 0.5 mg/l H 3 B0 4> and 10 mg/l 
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tetracycline under conditions of 32°C and pH 7.0. After exhaustion of glucose, glycerin was used as the carbon source, 
and culture was continued at 37°Cfor additional 8 hr to produce inclusion bodies of pGal-117S4HGP in the bacterium. 
After the completion of the culture, the culture was then homogenized by means of a Manton-Gaulline homogenizer 
(model 15M-8TA, manufactured by Manton-Gaulline) at 600 KG/cm 2 and centrifuged (CEPA centrifuge) to recover a 
5 precipitate fraction. The precipitate was suspended in a buffer (10 mM Tris-HCI (pH 8.0), 1 mM EDTA) in an amount 
equal to that of the culture, again recovered by centrifugation, and suspended in deionized water in an amount equal 
to that of the culture. Thereafter, centrif ugation was again performed to recover the precipitate which was then sus- 
pended in deionized water in an amount equal to that of the precipitate and used in the subsequent procedure. 

10 Example 1 3 Cleavage of hGLP-1 (7-371 from chimeric protein 

The chimeric protein inclusion body prepared in Example 12 was recovered, dissolved in 8 M urea, and diluted to 
give a reaction composition (20 mM Tris/HCI (pH 8.2), 100 mM NaCI, 3.0 M urea, 2.0 mM CaCI 2 , 8 mg/ml chimeric 
protein). Kex2-660 was then added to a final concentration of 10,000 U/ml to cleavage hGLP-1 (7-37). 90% or more 
is of pGal-117S4HGP as the control was cleaved under these conditions for one hr. 

Example 14 Fractionation of hGLP-1 (7-371 (Fig. 16, Table 3) 

In order to mass-produce the chimeric protein pGal-1 17S4HGP, which had been found to be expressed with a high 

20 productivity and to be suitable as a substrate for Kex2-660, the plasmid pG117S4HGP encoding this chimeric protein 
was transformed into E. coli strain M25 (W31 1 0/ompT), followed by culture in a 20 L culture tank. The medium comprised 
20 g/L yeast extract, 1 0 g/L bactotriptone, 1 2 g/L K 2 HP0 4 , 3 g/L KH 2 P0 4 , and 5 g/L glycerin. When the cell concentration 
reached OD660 1 .0, IPTG (isopropyl p-thiogalactoside) was added to a final concentration of 1 mM. An antifoaming 
agent (DISFOAM CC-222, manufactured by Nippon Oils & Fats Co., Ltd.) was used. The culture was continued at 

25 37°C until the cell concentration reached OD660 = 12. The cells were harvested and suspended in TE (10 mM Tris/ 
HCI, 1 mM EDTA (pH 8.0)). The suspension was subjected to a high pressure homogenization (Manton-Gaulin) to 
disrupt the cells. The homogenate was centrifuged to obtain precipitate containing chimeric protein, which was then 
washed with TE and deionized water. About 100 g of inclusion bodies containing chimeric protein were obtained from 
20 L culture. To 250 ml of the suspension of the inclusion (160 g/L) were added 100 ml of 1 M Tris/HCI (pH 8.2), 50 ml 

30 of 5 M NaCI, 500 ml of deionized water, and 900 g of urea, and the mixture was stirred for 30 min in a thermostatic 
chamber kept at 30°C to dissolve the components, diluted with warmed deionized water, and brought to 5 L at 30°C. 
50 ml of a 250 mM CaCI 2 solution was slowly added while stirring, and Kex2-660 was added thereto to 20,000 U/mt 
(about 4.0 mg/L). 2 hr after the addition of Kex2-660 (percentage cleavage: not less than 90%), pH was adjusted to 
6.4 by the addition of acetic acid, followed by two-fold dilution with deionized water to coagulate and sediment the 

35 chimeric protein remaining unreacted and the protective peptide. Centrifugation was performed to obtain a supernatant 
containing hGLP-1 (7-37). The supernatant was adjusted to pH 5.0 by the addition of acetic acid and passed through 
a cation exchange resin (SP Toyopearl) column equilibrated with a 2 M urea/10 mM sodium acetate buffer (pH 5.0) to 
adsorb hGLP-1 (7-37), followed by washing with a 2 M urea/1 0 mM sodium acetate buffer (pH 6.2) and elution of hGLP- 
1 (7-37) with a sodium chloride density gradient from 0 M to 300 mM. Acetic acid was previously placed in the fraction 

40 tube so as to give a final concentration of 3%, preventing the coagulation of hGLP-1 (7-37). A fraction containing hGLP- 
1 (7-37) was collected and passed through a low-pressure reversed phase column (Soken ODS-W) equilibrated with 
5% acetic acid to adsorb hGLP-1 (7-37), followed by elution of hGLP-1 (7-37) with a 50% acetonitrile solution containing 
5% acetic acid. 

Acetonitrile was removed under reduced pressure, and the residue was filtered through a 0.22-ujti membrane filter 
45 and lyophilized. 

The quantitative determination of hGLP-1 (7-37) will be described in detail. A solution containing hGLP-1 (7-37) 
subjected to 10- to 20-fold dilution with a sample buffer (1 M acetate/2 M urea), and the diluted solution is centrifuged 
to obtain a supernatant. The supernatant is then applied to HPLC (LC10A, manufactured by Shimadzu Seisakusho 
Ltd.) to which YMC-C8-A302 (4.6 mm I.D. x 150 mm) column (manufactured by K.K. Yamamura Kagaku Kenkyujo) 
so has been connected, followed by development of 10 to 20 jil of the supernatant using liquid A (0.1% TFA/10% ace- 
tonitrile) and liquid B (0.094% TFA/60% acetonitrile) with gradient B 40% -» 80%/20 min. The amount of hGLP-1 (7-37) 
is determined from the ratio of the peak area for hGLP-1 (7-37) to the peak area for standard hGLP-1 (7-37) having a 
known concentration, the peaks being detected by determination of absorbance at 220 nm. The recovery is given in 
Table 3. 

55 
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Table 3: 



Recovery of hGLP-1 (7-37) 


Steps 


Volume/[l] 


hGLP-1/ [g] 


Recovery/ [%] 


Kex2-660 reaction 


4.0 


4.8 


100 


Supernatant obtained upon precipitation with acid 


7.8 


3.3 


68 


Cation exchange column 


0.5 


2.8 


60 


Low-pressure reversed phase column 




2.0 


40 
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SEQUENCE LISTING 

SEQ ID NO: 1 

SEQUENCE LENGTH: 40 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TpPOLOGY: Linear 

MOLECULE TYPE: Synthetic DNA 

SEQUENCE 

AATTCATGAA ATCTGTTAAA AAGCGTTCTG TTTCTGAAAT 

SEQ ID NO: 2 

SEQUENCE LENGTH: 41 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TCAGCTGATG CATAACCTGG GCAAACACCT GAATAGCATG G 

SEQ ID NO : 3 

SEQUENCE LENGTH: 41 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AACGCGTCGA GTGGCTGCGT AAGAAACTGC AGGACGTCCA C 

SEQ ID NO: 4 

SEQUENCE LENGTH: 41 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY : Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AACTTCGTTG CGCTGGGTGC ACCGCTGGCT CCACGTGATG C 

SEQ ID NO: 5 
SEQUENCE LENGTH: 39 
SEQUENCE TYPE: Nucleic acid 
STRANDEDNESS: Single strand 
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TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AGGATCCCAA CGTCCGCGTA AGAAAGAAGA TAACGTACT 

SEQ ID NO: 6 

SEQUENCE LENGTH: 4 0 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGTTGAATCT CATGAGAAAT CCCTGGGCGA AGCTGACAAA 

SEQ ID NO: 7 

SEQUENCE LENGTH: 4 0 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GCCGATGTTA ACGTGCTGAC CAAAGCGAAA AGCCAGTAAG 

SEQ ID NO: 8 

SEQUENCE LENGTH: 3 3 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TCGACTTACT GGCTTTTCGC TTTGGTCAGC ACG 

SEQ ID NO: 9 

SEQUENCE LENGTH: 4 0 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TTAACATCGG CTTTGTCAGC TTCGCCCAGG GATTTCTCAT 
SEQ ID NO: 10 
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SEQUENCE LENGTH: 40 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GAGATTCAAC CAGTACGTTA TCTTCTTTCT TACGCGGACG 

SEQ ID NO: 11 

SEQUENCE LENGTH: 41 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TTGGGATCCT GCATCACGTG GAGCCAGCGG TGCACCCAGC G 

SEQ ID NO: 12 

SEQUENCE LENGTH: 41 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY : Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

CAACGAAGTT GTGGACGTCC TGCAGTTTCT TACGCAGCCA C 

SEQ ID NO: 13 

SEQUENCE LENGTH: 41 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TCGACGCGTT CCATGCTATT CAGGTGTTTG CCCAGGTTAT G 

SEQ ID NO: 14 

SEQUENCE LENGTH: 46 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 
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SEQUENCE 

CATCAGCTGA ATTTCAGAAA CAGAACGCTT TTTAACAGAT TTCATG 

SEQ ID NO: 15 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TCGAGGTCGA CGGTACCGAG CTCG 

SEQ ID NO: 16 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AATTCGAGCT CGGTACCGTC GACC 

SEQ ID NO: 17 

SEQUENCE LENGTH: 4 5 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AAACTGCAGG ACGTCCACAA CTTCTAAGCG CTGGGTGCAC CGCGT 

SEQ ID NO: 18 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

CATTAAAGCT TTGCGATGAT AAGC 
SEQ ID NO: 19 
SEQUENCE LENGTH: 26 
SEQUENCE TYPE: Nucleic acid 
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STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

CGCACCGATC GCCCTTCCCA ACAGTT 

S^Q ID NO: 20 

SEQUENCE LENGTH: 35 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TTTCCCGGGC CTCCGTGGGA ACAAACGGCG GATTG 

SEQ ID NO: 21 

SEQUENCE LENGTH: 4 2 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TTTCCCGGGA GGCCTTCTGT TAAAAAGCGG TCTGTTTCTG AA 

SEQ ID NO: 2 2 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

CACCATCATC ACCCTGGA 

SEQ ID NO: 23 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TCCAGGGTGA TGATGGTG 
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SEQ ID NO: 24 

SEQUENCE LENGTH: 36 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MQLECLE TYPE: Synthetic DNA 

SEQUENCE 

CACCACCATC ATCCTGGACA CCATCATCAC CCTGGA 

SEQ ID NO: 25 

SEQUENCE LENGTH: 36 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLE CLE TYPE: Synthetic DNA 

SEQUENCE 

TCCAGGGTGA TGATGGTGTC CAGGATGATG GTGGTG 

SEQ ID NO: 26 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AAGAAGAAGA AGCCTGGA 

SEQ ID NO: 27 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TCCGGACTTC TTCTTCTT 

SEQ ID NO: 28 
SEQUENCE LENGTH: 36 
SEQUENCE TYPE: Nucleic acid 
STRANDEDNESS: Single strand 
TOPOLOGY: Linear 
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MOLECLE TYPE: Synthetic DNA 
SEQUENCE 

GGGAGGCCTT CTGTTNCAAA GCTTTCTGTT TCTGAA 

SEQ ID NO: 29 

SEQUENCE LENGTH: 36 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTGTTNATAA GCTTTCTGTT TCTGAA 

SEQ ID NO: 30 

SEQUENCE LENGTH: 3 6 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY : Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTGTTNTTAA GCTTTCTGTT TCTGAA 

SEQ ID NO: 31 

SEQUENCE LENGTH: 36 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTGTTBGGAA GCTTTCTGTT TCTGAA 

SEQ ID NO: 32 

SEQUENCE LENGTH: 36 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTGTTSAAAA GCTTTCTGTT TCTGAA 

SEQ ID NO: 33 
SEQUENCE LENGTH: 3 3 
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SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY : Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTADTCATAA GCTTTCTGTT TCT 

SEQ ID NO: 34 

SEQUENCE LENGTH: 3 3 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTAHGCATAA GCTTTCTGTT TCT 

SEQ ID NO: 35 

SEQUENCE LENGTH: 3 3 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTGHGCATAA GCTTTCTGTT TCT 

SEQ ID NO: 36 

SEQUENCE LENGTH: 3 3 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTTK.KCATAA GCTTTCTGTT TCT 

SEQ ID NO: 37 

SEQUENCE LENGTH: 34 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 
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GGGAGGCCTT CTCVACATAA GCTTTCTGTT TCTG 

SEQ ID NO: 38 

SEQUENCE LENGTH: 34 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GGGAGGCCTT CTBATCATAA GCTTTCTGTT TCTG 

SEQ ID NO: 39 

SEQUENCE LENGTH: 2 7 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TO POLOGY : Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AATTAATGTG AGTTAGCTCA CCATTAG 

SEQ ID NO: 40 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

ATCCCAGTCT TTAGCTTGTA AAAC 

SEQ ID NO: 4 1 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GTTTTACAAC GTAAAGACTG GGAT 

SEQ ID NO: 42 
SEQUENCE LENGTH: 31 
SEQUENCE TYPE: Nucleic acid 
STRANDEDNESS: Single strand 
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TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TTTAGATCTG ACTCCAGCAA GCTGTCCGGC A 

SEQ ID NO: 43 

SEQUENCE LENGTH: 57 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

CGGAAGGTAC CTT7ACCAGC GATGTGAGCT CGTATCTGGA AGGTCAGGCG GCAAAAG 

SEQ ID NO: 44 

SEQUENCE LENGTH: 48 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

ACCTTCCAGA TACGAGCTCA CATCGCTGGT AAAGGTACCT TCCGCATG 

SEQ ID NO: 45 

SEQUENCE LENGTH: 36 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

AATTCA7CGC GTGGCTGGTG AAAGGCCGTG GTTAAG 

SEQ ID NO: 4 6 

SEQUENCE LENGTH: 53 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TCGACTTAAC CACGGCCTTT CACCAGCCAC GCGATGAATT CTTTTGCCGC CTG 
SEQ ID NO: 47 
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SEQUENCE LENGTH: 38 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS : Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

TGAATTTCAG AAGCATGCCG CTTATGTCGA GAAGGCCT 

SEQ ID NO: 4 8 

SEQUENCE LENGTH: 2 4 

SEQUENCE TYPE: Nucleic acid 

STRANDEDNESS: Single strand 

TOPOLOGY: Linear 

MOLECLE TYPE: Synthetic DNA 

SEQUENCE 

GACTCAGATC TTCCTGAGGC CGAT 
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SEQUENCE LISTING 

<1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Suntory Limited 

( B ) STREET: 1-40, Dojimahama 2-chome, Kita-ku, Osaka-shi 

(C) CITY: Osaka 

(E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP): 530 Japan 

(ii) TITLE OF INVENTION: Method for cleaving chimeric protein using 
processing enzyme 

(iii) NUMBER OF SEQUENCES: 48 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 97301407.9 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 8-70906 

(B) FILING DATE: 04-MAR-1996 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AATTCATGAA ATCTGTTAAA AAGCGTTCTG TTTCTGAAAT 40 
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(2) INFORMATION FOR SEQ ID NO: 2 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TCAGCTGATG CATAACCTGG GCAAACACCT GAATAGCATG G 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
AACGCGTCGA GTGGCTGCGT AAGAAACTGC AGGACGTCCA C 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
AACTTCGTTG CGCTGGGTGC ACCGCTGGCT CCACGTGATG C 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGGATCCCAA CGTCCGCGTA AGAAAGAAGA TAACGTACT 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGTTGAATCT CATGAGAAAT CCCTGGGCG A AGCTGACAAA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCCGATGTTA ACGTGCTGAC CAAAGCGAAA AGCCAGTAAG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
TCGACTTACT GGCTTTTCGC TTTGGTCAGC ACG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTAACATCGG CTTTGTCAGC TTCGCCCAGG GATTTCTCAT 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Li) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GAGATTCAAC CAGTACGTTA TCTTCTTTCT TACGCGGACG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
TTGGGATCCT GCATCACGTG GAG CC AGCGG TGCACCCAGC G 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAACGAAGTT GTGGACGTCC TGCAGTTTCT TACGCAGCCA C 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCGACGCGTT CCATGCTATT CAGGTGTTTG CCCAGGTTAT G 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CATCAGCTGA ATTTCAGAAA CAGAACGCTT TTTAACAGAT TTCATG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
TCGAGGTCGA CGGTACCGAG CTCG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AATTCGAGCT CGGTACCGTC GACC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAACTGCAGG ACGTCCACAA CTTCTAAGCG CTGGGTGCAC CGCGT 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CATTAAAGCT TTGCGATGAT AAGC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGCACCGATC GCCCTTCCCA ACAGTT 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTTCCCGGGC CTCCGTGGGA ACAAACGGCG GATTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 
TTTCCCGGGA GGCCTTCTGT TAAAAAGCGG TCTGTTTCTG AA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CACCATCATC ACCCTGGA 18 
(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
TCCAGGGTGA TGATGGTG 18 
(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CACCACCATC ATCCTGGACA CCATCATCAC CCTGGA 36 
(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 6 base pairs 
50 ( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCCAGGGTGA TGATGGTGTC CAGGATGATG GTGGTG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AAGAAGAAGA AGCCTGGA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TCCGGACTTC TTCTTCTT 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



EP 0 794 255 A2 



(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GGGAGGCCTT CTGTTNCAAA GCTTTCTGTT TCTGAA 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGGAGGCCTT CTGTTNATAA GCTTTCTGTT TCTGAA 
<2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGGAGGCCTT CTGTTNTTAA GCTTTCTGTT TCTGAA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GGGAGGCCTT CTGTTBGGAA GCTTTCTGTT TCTGAA 36 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GGGAGGCCTT CTGTTSAAAA GCTTTCTGTT TCTGAA 36 
(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GGGAGGCCTT CTADTCATAA GCTTTCTGTT TCT 33 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
GGGAGGCCTT CTAHGCATAA GCTTTCTGTT TCT 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGGAGGCCTT CTGHGCATAA GCTTTCTGTT TCT 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GGGAGGCCTT CTTKKCATAA GCTTTCTGTT TCT 
(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GGGAGGCCTT CTCVACATAA GCTTTCTGTT TCTG 

(2) INFORMATION FOR SEQ ID NO: 38: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGGAGGCCTT CTBATCATAA GCTTTCTGTT TCTG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AATTAATGTG AGTTAGCTCA CCATTAG 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
ATCCCAGTCT TTAGCTTGTA AAAC 24 
(2) INFORMATION FOR SEQ ID NO:41: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
15 ( B ) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 
GTTTTACAAC GTAAAGACTG GGAT 24 
(2) INFORMATION FOR SEQ ID NO: 42: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



TTTAGATCTG ACTCCAGCAA GCTGTCCGGC A 31 

45 

(2) INFORMATION FOR SEQ ID NO:43: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CGGAAGGTAC CTTTACCAGC GATGTGAGCT CGTATCTGGA AGGTCAGGCG GCAAAAG 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
ACCTTCC AG A TACGAGCTCA CATCGCTGGT AAAGGTACCT TCCGCATG 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
AATTCATCGC GTGGCTGGTG AAAGGCCGTG GTTAAG 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 i near 
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(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TCGACTTAAC CACGGCCTTT CACCAGCCAC GCGATGAATT CTTTTGCCGC CTG 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
TGAATTTCAG AAGCATGCCG CTTATGTCGA GAAGGCCT 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GACTCAGATC TTCCTGAGGC CGAT 

Claims 

1. A chimeric protein represented by the following formula; 

A-L-B 
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wherein A represents a protective peptide; 
B represents a target peptide; and 

L represents a linker peptide having the sequence -X 2 -(Pro, Lys, or Arg)-Arg, wherein X-, and X 2 represent 
any amino acid, in its C-terminal region and a domain rich in His in its N-terminal region. 

5 

2. The chimeric protein according to claim 1, wherein the domain rich in His in the N-terminal region has the sequence: 
[(His) 4 -Pro-Gly) n wherein n is 1 to 6. 

3. The chimeric protein according to claim 1 or 2, wherein any 1 to 5 amino acids exist between the amino acid 
to sequence in the N-terminal region and the amino acid sequence in the C-terminal region. 

4. The chimeric protein according to any one of claims 1 to 3, wherein represents Arg, Lys, or His and X 2 represents 
His or Phe. 

is 5. The chimeric protein according to any one of claims 1 to 4, wherein A represents a polypeptide not containing the 
amino acid sequence: Pro-Arg, Arg-Arg, and Lys-Arg. 

6. The chimeric protein according to any one of claims 1 to 4, wherein when A represents a polypeptide containing 
the amino acid sequence: Pro-Arg, Arg-Arg, or Lys-Arg, at least one amino acid in said amino acid sequence has 

20 been substituted with a different amino acid so as for A not to contain said amino acid sequence. 

7. The chimeric protein according to any one of claims 1 to 6, wherein A represents a polypeptide of not more than 
220 amino acid residues. 

25 8. The chimeric protein according to claim 7, wherein A represents a peptide of amino acids 1-97, 1-117, or 1-139 
on the N-terminal side of [5-galactosidase derived from E. coli. 

9. The chimeric protein according to any one of claims 1 to 8, wherein any cysteine residues in A has been substituted 
with a different amino acid. 
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10. The chimeric protein according to claim 9, wherein the different amino acid is a serine residue. 

11. The chimeric protein according to claim 1, wherein B represents a human parathyroid hormone or a derivative 
thereof. 

12. The chimeric protein according to claim 11, wherein the human parathyroid hormone derivative comprises a frag- 
ment comprising amino acid residues from 1 or 3 to 31 , 34, 37, 38, or 84 in the N-terminal of a naturally occurring 
human parathyroid hormone, or a fragment comprising said amino acid residues with Gly being further added to 
the C-terminal. 

13. The chimeric protein according to claim 1 , wherein B represents GLP-1 or a derivative thereof. 



14. A process for producing the chimeric protein according to any one of claims 1 to 13, comprising the steps of: 
transforming an expression vector containing DNA encoding the chimeric protein into a host cell; culturing the 

45 resultant transformants; and harvesting the chimeric protein from the culture. 

15. A process for producing a target peptide (B), wherein a processing enzyme is allowed to act on the chimeric protein 
according to any one of claims 1 to 13 to cleave a peptide bond between the C-terminal of the linker peptide (L) 
and the N-terminal of the target peptide (B) to obtain the target peptide (B). 

so 

16. The process according to claim 15, wherein the processing enzyme is a member selected from Kex2 protease, 
lurin, a prohormone convertase 1/3 (PC 1/3), a prohormone convertase 2 (PC2), and derivatives of said processing 
enzymes. 

55 17. The process according to claim 15 or 16, wherein the target peptide (B) is recovered by isoelectric precipitation. 
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